An architecture for parallel topic models
نویسندگان
چکیده
منابع مشابه
An Architecture for Parallel Topic Models
This paper describes a high performance sampling architecture for inference of latent topic models on a cluster of workstations. Our system is faster than previous work by over an order of magnitude and it is capable of dealing with hundreds of millions of documents and thousands of topics. The algorithm relies on a novel communication structure, namely the use of a distributed (key, value) sto...
متن کاملScalable Parallel Topic Models
U) The topic model is a popular probabilistic model for text and document modeling. It can be used for topic indexing, document classification, corpus summarization and information retrieval. In the past, topic models have been applied to corpora containing thousands to hundreds of thousands of documents. Now there is an increasing need to model collections with millions to billions of document...
متن کاملModel-Parallel Inference for Big Topic Models
In real world industrial applications of topic modeling, the ability to capture gigantic conceptual space by learning an ultra-high dimensional topical representation, i.e., the so-called “big model”, is becoming the next desideratum after enthusiasms on ”big data”, especially for fine-grained downstream tasks such as online advertising, where good performances are usually achieved by regressio...
متن کاملCommunication-Free Parallel Supervised Topic Models
In this project, we develop a parallel algorithm for supervised latent Dirichlet allocation (sLDA) Mcauliffe & Blei (2008) which maintains the speed advantage of communication free parallel computing in Neiswanger et al. (2013) while at the same time bypassing the problematic quasiergodicity in the local posteriors combination stage. Since the main objective of sLDA is prediction rather than me...
متن کاملThe Topic Browser An Interactive Tool for Browsing Topic Models
Topic models have been shown to reveal the semantic content in large corpora. Many individualized visualizations of topic models have been reported in the literature, showing the potential of topic models to give valuable insight into a corpus. However, good, general tools for browsing the entire output of a topic model along with the analyzed corpus have been lacking. We present an interactive...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the VLDB Endowment
سال: 2010
ISSN: 2150-8097
DOI: 10.14778/1920841.1920931